Acoustic and perceptual analysis of discontinuities in two TTS concatenation systems

نویسنده

  • Jonas Lindh
چکیده

Background Discontinuities It is fair to say that L&H’s (now Scansoft’s) RealSpeak and AT&T’s NextGen are two of the most natural sounding unit selection systems. The transitions between connected units sometimes contain discontinuities, thus creating one of the greatest problems concerning the output in these kinds of systems. The discontinuities are often perceived as ‘jumps’, i.e. a disturbance. The analyses in this paper investigate the acoustic properties of the ‘jumps’, if they are perceived as disturbing and in that case how disturbing. The results show that the selection criteria do not include enough information on single acoustic parameters, such as formants. Since listeners perceive discontinuities in formants, especially F2, as disturbing, one of the conclusions is that the next step in developing these systems must be to include more information on these parameters separately (especially formants 2 and 3) to improve the selection process. Of course other things like increasing database size and better structuring of data etc. can also improve the selection process as well as better grapheme to phoneme conversion, but those aspects are not dealt with here.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Data pruning approach to unit selection for inventory generation of concatenative embeddable Chinese TTS systems

In this paper, a data pruning approach is presented for building acoustic unit inventory for syllable-based concatenative embeddable Chinese TTS system. A 3-portion segmentation of a syllable is proposed based on the nature of voiced/unvoiced structure of Chinese syllable. Individual factorial acoustic measurement of syllable is used to calculate the penalty of perceptual unsatisfactory for con...

متن کامل

Data-driven perceptually based join costs

Concatenative speech synthesis systems attempt to minimize audible discontinuities between two successive concatenated units. In unit selection concatenative synthesis, a join cost is calculated that is intended to predict the extent of audible discontinuity introduced by the concatenation of two specific units. A study was conducted that used human perceptual data on the detectability of mid-v...

متن کامل

Spectral Continuity Measures at Mandarin Syllable Boundaries

In Text-to-Speech (TTS) systems based on concatenative synthesis, the naturalness of synthetic speech is highly affected by the spectral continuities at the concatenation point. In this paper, we focused on 4 kinds of syllable boundaries in mandarin and used several spectral distance measures combined with time derivatives distance measures to predict their audible discontinuities. A perceptual...

متن کامل

A new Japanese TTS system based on speech-prosody database and speech modification

This paper describes a new Japanese text-to-speech (TTS) system that can produce highly natural and intelligible synthetic speech. The good performance of the new TTS system derives from three new sophisticated approaches as follows; (1)A new prosody control algorithm that uses prosody data extracted from a natural speech database and a duration control algorithm based on statistical estimation...

متن کامل

Applying the harmonic plus noise model in concatenative speech synthesis

This paper describes the application of the harmonic plus noise model (HNM) for concatenative text-to-speech (TTS) synthesis. In the context of HNM, speech signals are represented as a time-varying harmonic component plus a modulated noise component. The decomposition of a speech signal into these two components allows for more natural-sounding modifications of the signal (e.g., by using differ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004